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Aw. EEBCBIES 


Current military policy iacludes an exfansion of the 
Navy fxrcm the present level cf approximately 500 shifs to 
apprcximately 600 ships by the end of the decade. This 
expansicr is being conducted at atime when the frimary 
recruiting pool, 17 - 21 year old males, is declining. There 
are two avenues which must be pursued to prevent a shcert 
fall in personnel tc man the expanding fleet. First, the 
Navy must retain high guality personnel and secondly, the 
Navy must recruit sufficient Rhumbers of high guality 
perscnnel tc replace attritions and simultaneously meet ever 
increasirg wanpower requirements. 

It is not enough just to recruit warm todies tc fill 
vacanciés. The ability cf the American serviceman to 
adequately operate and maintain equipment of increasing 
sophistication has been widely debated in Congress and the 
press. Stcries abound of the expected deterioraticr cf 
eguipfzent readiness during periods of increased tensicrs in 
the event of the loss of civilian personnel. To maintain an 
effective fcrce, the Armed Services must be able to operate 
and maintain its e€guipment independent of nonccmkrEatant 
Civilian technicians. In order to achieve this goal, 
Ferscrnel with the akility to learn the operation and main- 
tenance of ever nmcre Scphisticated equipment must be 
cbtained and retained. 

Cne€ avenue proposed to alleviate a manpower shortfall is 
an e€xfanded role for women in the Armed Forces. This would 
seen, cn the surface, to be a viable solution to the preblen 


Since wcuen have been historically less expensive to recruit 
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and retain than wen of ccmparable guality [Ref. 1). 
However, the Navy is frecluded from taking full advantace of 
this scurce of manpcwer due to public law precluding the 
Fermanent assignment of women to combatant ships. This 
requires focusing the recruiting effort towards high quality 
rene The definiticn of "high guality" has been widely 
debated. For the purycses of this study, "high quality" will 
ke defined as high school graduates in Mental Grourfrs I 
through III upper as measured by test scores on the Arzred 
Forces Vccational Battery of tests (ASVAB). 


F. ‘FCCCS 


This tkesis 1s concerned with develcfing a mcdel to 
forecast the number cf high guality nonprior service rales 
signing ccntracts cn the recruiting district level. The 
dependent variable is net contracts signed: CNTRCTS. Values 
of this variable are determined by subtracting Delayed Entry 
Frogram (DEF) attrites during the month of attrition from 
contracts signed during the same month. 

Tkree approaches to forecasting CNIRCIS are taken. 
First, the Box-Jenkins methodology 1S applied to a mcrtkly 
time series of the net numrer of nonprior service high 
schocl graduate malés signing contracts in the Alkuguergue 
New Mexico recruiting district during the feriod Octoter 
1978 tc September 198&3 (60 data points). This methcdclccy is 
explained in detail in Aprfendix A. The second affroach 
utilizes "kest fit" lagged variables ina block regression 
format tc provide a causal model using economic data, the 
number cf froduction recruiters and military/civilian fay 
ratic as explanitory variables. The third frocedure 
combines the causal acdel and the Box-Jenkins methcdology in 
a mixed time series/causal model to attempt to provide mcre 
accurate forecasts. 


C. TEE ENVIRONMENT 


This thesis will limit the scope of study to a single 
recruiting district. It is hypothesized that better esti- 
mates of the effects of changing economic factors and the 
effects cf the variation in the number of production recrui- 
ters w#ill ke oktairned at this level than ona national 
SCOPE. During any change in the general trends of the 
naticn's eccnomy, local conditions are apt to vary consider- 
ably ficm recruiting area to recruiting area. It may Ee that 
the eccnczic trends in a particular recruiting district are 
in direct cpposition tc those on the national level. f&khile 
goals and guotas are specified at a national level, recruit- 
gent cf individuals cccurs at the district level. The Navy 
Recruiting Command (NRC) allocates a fraction cf the 
national gucta to each of the Navy Recruiting Areas (NRA). 
The six NRA'S in turn allocate fractions of their guota to 
€ach Navy Recruiting District (NRD) within the aré€a. 
Bistcrical enlistment rates and differences in regional 
Fopulaticn are primary determinants of guota alliccaticn, 
althcugh eé€conomic ard demographic factors are also taken 
into consideraticn [kef. 2}. Information ottained froma 
regicnal study, through a better understanding of the inter- 
acting flrocesses, fray be cf more practical value in allo- 
cating recruiting rescurces for maximum benefit, and in mere 
eguitakly assigning recruiting goals. 

Tke recruiting district chosen for study is the 
Albuguergue, New Mexico recruiting district. This district 
was chcsen at random from the 44 recruiting districts nation 
wide. This district encomfasses the entire state of New 
Mexico, the panhandles of Texas and Oklahoma as well as 
seven ccunties in scuth western Kansas. In general this 
district experienced an increasing work force, increasing 


unemgplcynent and a Significant variation in the numker of 


oe 
tad 


Froducticn recruiters during the period of study. Aprendix B 
contains tke time series plcts of the variables. 

Backgrcund demographic factors for the area that are cf 
interest are as follcws. The average per capita income for 
the state cf New Mexico in 1979 was $ 6119, 43 rd in the 
naticr. The populaticn in 1980 was 75% White, 2% Black and 
@% American Indian. 37% of the population was of Spanish 
crigin.e 13% of the fopulaticn was between 18 and z4 years 
cld [Bef. 3}. 
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A. CABLING (1979) 


In his thesis on the supply of first term enlistees to 
the United States Marine Corps, Darling, [Ref. 4], [rcevides 
a detailed review of the literature on the supfly rcdels 
develcred from the Gates Com@mission (November 1970) tc the 
Fublicaticn of his thesis (March 1979). The fcllewing 
review will cover tke pertinent literature commencing with 
Carling's werk. 

Darling used tre distinct analytical technigues to 
develcr mcdels in crder to forecast monthly first tern 
enlistees in the United States Marine Corps. A_ tultirfle 
Tegressicn model was derived based onit's compatitility 
with his theory cf occupaticnal choice, the intuitive arreal 
cf certain explanatcry variakles, the past literature of 
Manpewer supply and the statistical significance of each 
variakles impact on monthly enlistments. The data base 
utilized was a time series of monthly enlistments spanning 
the pericd from July 1973 to June 1978. 

Fer his multiple regressicn model, Darling chose as the 
dependent variable; monthly cbservations of the fratio of 
tale nonpricr service high school graduates enlistirg ir the 
regular Marine Corps to male high school graduates, aged 16 
to 24, net enrolled in college (S). Darling chose as the 
independent variables: (1) Civilian - Military pay ratio 
(CMPR), (2) a deseascnalized estimate cf the naticnal unen- 
Floynent rate for persons aged 16 to 19 (UNEM), (3) the 
monthly numker of qualified leads obtained from festcards 
which were included in the national printed media advertise- 
feats (LEADS), (4) a dummy variable to correct for the 


abnorgally large nugrter of enlistments inthe uacnth of 
Lecemrer 1976 to take advantage of the expiring GI Bill (TJ) 
and (£) the numker cf Marine recruiters on duty each nonth 
(EFFREC). His initial hypothesis was that the numker of 
gonthly enlistments is dependent on the above indeperdent 


variakles ir the functional form of a logistic model: 


S= (+ ex - (Bote, CHEE 


#B,EFFREC +B,UNEM +B, LEADS +B-DJ +a)| “1 
2 3 4 5 ) 


Coly thcse regular enlistments of male high schocl grad- 
vates in the top two mental groups were considered tc ersure 
that tke crserved rumber of acathly enlistments was not 
restricted Ly Marine Corps pelicy (demand driven) and was 
therefore ar accurate representation of points on the supply 
curve. All the indéfendent variables listed above (except 
[CJ) were lagged fren one tc six months and regressicns were 
Tun te disccver those variables that significantly ccntrib-. 
uted tc the pattern of enlistments and to ascertain the. 
apprcximate delay in the impact on contracts signed ofa 
change in the value of one of the variables. The final 
regressicno eguation selected was based on the intuitive 
appeal cf the regression equation and the statistical 
Significance of the results. The best fit equation was 


determgined to be: 


me 


1- 
wheres 
S = the enlistazent rate 
CMFE5 = civilian - nailitary fay ratio 
lagged five months 
UNEM = uremplcyment with no lag 
DJ = dummy variatle for December, 1976 
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a = random error. 


all independent variables were reported to be signifi- 
cant at the 10% level with a coefficient of determination, 
R-squared value, of .42. Of significance is tae exclusion 
of the variable representing recruiters. Darling reported 
that when this variable was forced into the regr2ssion equa- 
tion, its' cregressicn coefficient was never statistically 
different from zero. Darling also reported a Durbin-Watson 
statistic of 1.08, indicating a positive serial correla- 
tion among the residuals. He felt that this was probably due 
to the lack of an attempt to explain the seaSonal nature of 
Marine Corps enlistments to this point in his study. In an 
attempt to correct this problem, the Box-Jenkins methodology 
was applied. 

Darling utilized the data from the multipl2 regression 
model ina Box-Jenkins procedure to arrive at a final model: 


Vp = -804251Y 4.) + ¥p4.--8042 51% 45-7. 095902 +e4-.478727e 4. \9 


Where Y was the number of male high school jraduates in 
mental groups I and II who enlisted in month t and e was 
the error term. Otilizing this model, he conducted two 
forecasts with varying degrees of success. The first fore- 
cast conducted incorrorated the abnormally hija enlistment 
month of December 1976. This resulted in the distortion of 
the forecasts. For the second forecast, not iacorporating 
December 1976, the pattern of enlistments was well repre- 
sented by the model. 

Darling concluded his thesis by combining the _ two 
methods in applying Box-Jenkins to the residuals of the 
multiple regression analysis as if they were an original 
time series. He then used this combined model to forecast 
Marine enlistments with a greater degree of azscuracy than 


with either model individually. 
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E. CALE AND GILROY (1983) 


Cale and Gilroy ir two studies of the economic deterzi- 
nants cf military enlistment rates [Ref. 5 and 6] utilized a 
regressicn equation to forecast enlistment rates cf male 
kigh schccl graduates in mental groups I through III uprger 
for tke Aruay. The dependent variable was total ccntract 
signers divided ry tke relevant Civilian populaticn cf 16 to 
19 year clds. Their basic equations were estimated witha 
linear functional ftcrm using generalized least squares, 
correcting for the presence of first order autocorrelaticn. 


Their eguation fcr all nales was; 


A/P = -1S€E&.90 + 16.29 UM + 7.77 UM-2 + 4.59 UM-4 
+ 40.89 WH + 313.07 BILL + 408.40 VEAP 
¢ 6.26 KICK + €&4.65 Q3 
+ 819.73 GI + 148.38 TARGET —- 0.00 RECRTR 


wheres; 

A = Army accessions plus DEP of male 
NES high school grads in mental 
groups I through III upper. 

P = Revelant male population of 
16 to 19 year olds. 

UM = Current unemployment rate of 
16 to 19 year olds. 

UM-z = UM lagged two months. 

UM-4 = UM lagged fcur months. 

W4 = Ratio cf first year basic military 
ccmpensaticn to average 
weekly earnings in the private sector, 
with a four month lead. 

GI = Dummy variakle = 1 in December, 1976 when 


ie 


GI Bill expired, and =0 everywhere else. 


Maximum monthly benefit for a GI bill 


Brit 
beneficiary without dependents, deflated 
Ey the consumer price index. Variable 


is set to zero after December, 1976. 


VEAE 


Maximum monthly benefit for a VEAP reneficiary 
without derendents , deflated by the consumer 
Frice index. Variable is set to zero before 
January, 1°77. 


Maximum value of kicker payment (offered 


Kick 
cnoly te Arry enlistees who entered critical 
specialties in combat arms) deflated by 
the consumer price index. 


TARCEI= Einary variatle = 1 from November, 1979 to 
August 1981 when high school grads were 
Specifically targeted, and 0 elsewhere. 


Q3 = Seasonal dummy variarle = 1 in July, August 


and Serptemlrer, and = 0 elsewhere. 


RECETR= Number of froduction recruiters. 


As with Darling's work, of some interest is the insig- 
nificant statistical results for the number of frcduction 
recruiters. Dale and Gilroy do not conclude that recruiters 
are urimportant, bet that the variation in the rumker of 
Froducticn recruiters has net Feen large enough tc shcw any 
correlation with enlistment rates. They include the variatle 
"TARGEI" as a proxy fcr the recruiters! efforts. 

In their study tkey determined that a rise in the unen- 
Floyment rate of 16 to 19 year old males led toa signifi- 
cant increase in Army enlistments and that a drcfz in 
Naticral unemployment from ten percent to nine percent sould 


Us 


cause Aragy enlistments of male nonprior service high schcol 
graduates to fall cy about 8.8 percent. They project tkat 
the elasticity of tctal Army enlistment rates with respect 
to relative pay is ir the range of .9to41.7 fercent for 
mental] grcurs I to III upper. They concluded that a rela- 
tive decrease in military pay cf one percent in relaticna to 
Civilian fay would cause enlistment rates to fall sukstan- 
tially. Also, they found a statistically significant 
increase in the rate of enlistments during the feriod 
Novemkter 1979 to Augtst 1981 when the Army was congzitted to 
increasing the percentage cf high school graduates. 
Finally, they ccncluded that educational benefits are very 
important tc many high school graduates, including thcse in 
the higkest mental grcups. 


C. GCLIEERG AND GREEBSTON (1983) 


GCcldkerg and Greenston [Bef. 7] provided a frogress - 
report cn on-gcing time series cross-sectional analysis 
research into the determinants of high quality enlisted 
Supply. They ccntended that utilizing a regional data base 
rather than a naticnal afppreach permited more accurate 
twodeling of demcgrarkic factors and produced better esti- 
mates of the effects of changing labor market conditions. 
They asserted that Ey utilizing a regional approach, a 
greater capability tc optimize the allocation of gcals, 
recruiters and advertising is achieved. 

A regression analysis was used to estimate the effects 
cf supply factors cn the number of contracts signed by 
nonprior service sale high school graduates for each 
service. Annual data for the time pericd 1976T to 1982, 
Using 43 Navy recruiting districts as units of observation, 
was evaluated. Sererate regression models were develored 
for mental groups I tc III upper and mental group III lcwer. 


They assumed the suyfly of HSDG contracts in a recruiting 
district to be a lcg-linear function cf supply factors in 
the fcrm of: 


Lo B = 1.£1 + «75 LEFAY + .05 RUNEM + .15 LAVG 
- 0.26 VEAF + 0.43 LEOP - 0.0089 BLK 
¢ .0023 UREAN + 0.58 LNREC + Error tern 


Where; 

LREAY = Icgarithm cf regular military compensation 
cver a four year feriod, divided by full 
time eguivalent earnings of civilian youth. 
Military and civilian earnings are discounted 
at 30 percent. Civilian earnings are aged 
with data cr median weekly earnings of 16 
tc 19 year clds. 


RUNEM = Change in the uneuplcyment rate for all 
Civilians. 

LAYVG = Icgarithm cf the average unemployment rate 
fer all civilians in FY 1976-82. 

VEAE = LDummy variatle egual to zero in FY 1976T 
and one in FY¥'s 1977-82. It measures 
tke net effect of the change over fron 
the GI Bill to VEAP. 

LFCF = Legarithm cf a district's 17 to 21 year old 
pale population in thousands. 

BLK = percent of a district's 17 to 21 year old 
male pcpulation that is black. 

UFCE = Fercent of a district's 17 to 21 year old 
male pcpulation that resides in a urban area. 

LNRFC = Icgarithm cf Navy's production recruiters. 
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In the results cf their findings for mental groups I 
through Iil upper, they ccncluded that relative silitary 
pay, cyclical unemplcyment, total population, urban mix and 
recruiters increase enlistments, while black populaticr and 
the lcss of the GI Bill cause enlistments to decline. They 
deterained that a oné€ percent increase in relative rilitary 
Fay would cause an increase of Navy enlistments cf 0.75 
Fercent and that a ore percent increase in the unemglcyment 
rate would increase enlistments by between 4.3 and 5.1 
percent. Although they indicated that their results 
utilizing regioral unemployment were i1inconsistant, it 
appeared that ic general arise in regional unenplcyment 
caused a rise in enlistments in the region. 

They fcund that the loss of the GI Bill caused a decline 
in the suprly of target candidates of 26 fercent tc the 
Navy. TItey additionally determined districts having greater 
Fopulaticn tended to have mcre recruiters resulting in their . 
separate effects beinc difficult to measure. They did deter-_ 
Mine that a doubling cf recruiters and population results in 
slightly ucre than a doubling of enlistments, and that a cne 
Fercent rise in the urban population increased enlistments 
ky one tc two tenths cf a percent. Overall, they ccncluded 
that the research apfroach using annual time series, cross- 
secticnal cata was fruitful and that additional fresearch 
will enhance the usefulness of this model for forecasting 
and pejicy analysis. 


III. AEELICATION CF BOX-JENKIN 


A. ARWALYSIS STEF OBE 


The Box-Jenkins methodology was agplied toa acrthly 
time series of nonprior service high school graduate rales 
signing contracts in the AlEuguergue, New Mexico recruiting 
district during the feriod October 1978 to Septemrer 1583 
(60 cata feints). Ihe data was limited to those ccntract 
Signers ix mental grcups I through III upper and represented 
net ccntracts signed, that is new contracts signed less 
Delayed Entry Progran (DEP) attrites. Figure 3.1 1s a time 
series flct of this data. This plot suggests strong seascn- 
ality in the number cf high quality male enlistes with peak 
months appearing to ke in late summer, predominately August, 
and in January. An "average" year was constructed by aver- - 
aginc tke individual months over the five year _ fericd. 
Figure 3.2 is the tine series flct of this average year. The 
averace for each montrE is denoted by the letter "A". Also 
depicted in the graph is the average of the contracts signed 
in tte first three years of the data series denoted Ly "1" 
and tte average of the last two years denoted by "2". The 
average cf all the years ,"A", tends to support the assunp- 
tion cf seasonality suggested in the time seris flot cf the 
data set. However, in the flot cof the last two years alone, 
the suggestion for seasonality is not as clear which would 
tend to indicate that some frocess change has taken flace. 

Cf scme€ interest is the afparent sudden increase in the 
mean numter of contract Signers over the approximate last 
third cf the series. Further analysis indicates that for 
the time series aS a whole the mean number of ccntracts 
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Signed per month was 49.07. For the approximate first two 
thirds of the series the mean was 42.83 and for the approxi- 
mate last third the mean was 62.53. Because of this unusual 
pattern, two additional time series plots were constructed 
Or this data. The first was a time series plot for the 
first forty one data points (October 1978 to February 1982), 
figure 3.3, and the second was for the last nineteen data 
points (March 1982 to September 1983), figure 3.4. Although 
no obvious trend is apparent in either data subset, 


both figures io provide some indication of seasonality. 
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Figure 3.2 Average Recruiting Year. 
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Ihe rext step in develoging the models for high quality 
recruitment was to slot the autocorrelations for the data 
sets. Figure 3.5 is the autcccrrelation plot develcred by 
the Minitak general [urpose statistical computing syste for 
the fericd Cctober 1578 to September 1983. An examinaticn of 
these autoccrrelatiorns indicate that non-stationary Leéhavior 
in the frocess level (a trend) is evident in the the data. 
The evidence of such behavicr is indicated by having auto- 
correlations which dc not die cut rapidly to zero. Altkough 
the autoccrrelations for this data indicates a general tread 
in the cata, the tinae series flots and the mean difference 
in tke two data suksets tend to suggest not an cverall 
trend, krut a sudden and sustained overall increase in the 
number cf ccntracts signed [Ey the target cohort. This gay 
ke caused Ly some yet to be determined external influence. 
An evaluation of this sudden and sustained increase in 
contract signers wili be addressed in Chapter Four. 

TIwc separate courses of aralysis were fursued. First, in 
the event that the cause of the sudden increase in the mean 
could net te identified and guantified, a model which sas 
adequate tc describe the data set as a whole was sought. 
Second, in the event that the cause could be identified, 
its" affect quantified and any future occurrance predicted, 
a single mcdel which satisfactcrily described the underlying 
data patterns of eack subset around their mean was sought. 
If the cause of this sudden change could be deteruired or 
contrclled, then the second mcdel could possibly be used to 
more accrrately forecast future contract signers. 
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Figure 3.3 Monthly Ccntracts (Oct 78 - Feb 82). 


E. AUICCORKEELATIONS ABD PABTIAL AUTOCORRELATIONS EXAMINED 


1. Cctcber 1978 to Septemter 1983 


As evidenced by figure 3.5, there is a suggested 
trend for this data set due to the increase in the mean 
number of ccntracts signed. As the Box-Jenkins methodclogy 
requires stationary cata to be effective, differencing! was 
used. This data set was differenced once and the resulting 
autoccrrelations and partial autocorrelations, as repre- 
sented ir figures 3.€ and 3.7, were examined. 


1Pifferencin 4, thes Second by subtracting tke first 
value frcg the gs cond , the ond from the third, the third 
from the fourth, SEce Differencing is nbe most rapid method 
cf remcving a trend eee oR a time series [ Ref. oa 
Tifferencing may be ferformed once or more in order oO 
Pacts, the trend, hcwever differencing once is usually 
adequate. 
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Figure 3.4 Monthly Contracts (Mar 1982 tc Sep 1983). 


Three statistically significant data pcints are 
cbserved in figure 3.€: the first, the twelfth and the thir- 
teentk. Cverall, this plot suggests that the trend has been 
removed and that only horizental, stationary, data remains. 
The strorg positive autocorrelation at the twelfth period 
supperts the seasonality suggested by figures 3.1 and 3.2 


ze Cctcber 197&€ to Fekruary 1982 and March 198z to 


Figures 3.8 through 3.11 are the autocorrelaticas 
and partial autocorrelations fcr the periods Octoker 1$78 to 
February 1S9€2 and March 198z tc September 1983 respectively. 
These slets suggest that both subsections of the data set 
are trendless, with a suggesticn of seasonality evident only 
in the Cctokrer 1978 tc February 1982 data. Identification 
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Figure 3.5 
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Figure 3.6 


AUTCCORRELATICNS OF DIFFERENCED DATA. 
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PARTIAL AUTOCORRELATIONS OF DIFFERENCED DATA. 


Figure 3.7 
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ACTIOCORRELATICNS (OCT 78 - FEB 82). 


Figure 3.8 
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PARTIAL AUTOCOREELATIONS (OCT 78 - FEB 82). 
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AUTOCORRELATIONS (MAR 82 - SEP 83). 
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Figure 3.11 PARTIAL AUTOCOREELATIONS (MAR 82 - SEE 83). 


C. SCLEL DEVELOFMENT 


In ccmparing the autocorrelations and partial autcccrre- 
laticne to the varicus ARMA models in figures A.5 through 
A.13, a mOving average model 1s suggested for the data set 
for Cctcber 1978 tc Septemker 1983 as well as fcr each 
subset of the data. HEcwever, which particular MA model would 
kest descrikre the underlying data pattern in each case is 
unclear. 


1. Cctober 1978 to Septemkrer 1983 


faking into considération the apparent seascnality 
cf the data set, tte apparent trend in the data anc the 
uncertairty as to wkich specific MA godel would Ee acest 
apprerriate , twce models were developed for evaluaticn tased 
ch a visual inspecticn of the autocorrelations and partial 
autcccrrelations. 


ta) 
A> 


ae First Order, Seasonal, Moving Average Model with the 


foLla: 


Yeates AX,= (1- 0,8) (1- 6,822) e, (eqn 3.1) 


k. Second Crder, Seascnal, Moving Average Model with 


tte form: 


y -y. =Ay, = (1-6,B- 0,82) (1-6,B!2)e, (eqn 3.2) 


Where € is the error tern. 


In developing the mcdels to be tested for this data 
subset, the absence of an apparent trend, the suggested 
seasonality, and the uncertainty aS to which particular 
moving average model would best describe the underlying data 
patterns were taken into consideration. Two models were 
selected for evaluaticn based on a visual inspecticn cf the 
autoccrrelations and fartial autocorrelations. 


a. First Orcer, Seasonal, Moving Average Model with the 


rer i: 


ee (1-6, B) (1- O+b2 <)> (egn 3.3) 


rE. Secend Crder, Seasonal, Moving Average Model with 


the form: 
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Y. = (Ie BG be) (1 em eeeee (egn 3.4) 


Where € is the error tern. 


tad 


- arene 


tc September 1983 


In developing the models to be tested for this data 
subset, the absence of an apparent trend, 1nsufficiert 
length in tke time series to exhibit seasonality, and the 
uncertainty as to which particular moving average nrodel 
would Lkest describe the data patterns were taken inato 
consideraticn. Twc models were selected for evaluation 
kased on avisual irspection of the autocorrelaticns and 
Fartial autccorrelaticns. 


a. First Order MFcving Average Model with the form; 


¥, = (1- 6,8) er (egn 3.5) 


rk. Seccnd Order Fcving Average Model with the fcrn: 


Yr =(1- OF OF) Cy (egn 3.6) 


Where € 1s the error tern. 


C. AMCLEL EVALUATION 


The models develcped were evaluated utilizing the 


Minitak seneral furpcse statistical computing system ccumand 
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NARIMA" (Autoregressive Integrated Moving Average).2 In 
crder for a model tc be ccnsidered adeguate the residuals 
(errcrs) of the estimates generated by the model wculd be 
required to be randca (unccrrelated) with a mean of zero. 
If each cf the models developed for each specific data set 
satisfied tris requirement, the sum of sguared errors and 
the I-ratics for eack model wculd be examined to determine 
the mcst apfropriate model. The results of this evaluation 


are ccntained in Table I. 


EF. RESUITS OF THE MCLEL EVALUATION 


As evidenced by table I, both models consicered 
satisfied tke reguirezent of random residuals. There exists 
little difference between the two nodels to suggest which is 
the acre appropriate, although confidence in the seccnd . 
model is reduced by tke low TI-ratio for the MA(2) operatcr. | 
In this case the prircirle cf farsimony3 was invoked and the 
selected mcdel was tke MA(1) with the forn; 


Y, = (1-.644B) (14. 448B12) ea, (egn 3.7) 


(- 101) (- 1€z) std. error 


<The ote eee and notations used oy this command, fcllow 
the cnes developed by Box and Jenkins. The values of and 

are détermined utilizing a nonlinear least squares al§crithn 
develcped ay MaxrGguart. The AFIMA command fits non~séasonal 
and se€éscnal modéls to atime series. with input consistir 
cf a Sstcred time series and information as to the degree o 
the AB, MA cr ARMA mcdel to be fitted [Ref. 9}. 


3The principle of esac cautions to attemrt tc use 
the smallest possible nhumber of parameters which represent 
the mcdel adequately [Ref. 10}. 
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ze Cetcher 1978 to Februame 1962 


The two models selected for this data sukset were 
Cun with results displayed in Table I. The residuals for 
koth models met the randomness reguirement, however the low 
I-ratics fer the non-seasonal coefficients in both mcdels 
was less than encotraging. As a result of these low 
T-catios, another model, SMA(1), was attempted. This srodel 
has tke form: 


Y, = Geauts @,,212) €, (egn 3.8) 


An €xaminaticn of Takkle I for this data set indi- 
cates little to distinguish one model from another in terms 
cf mcre adeguately describing the data patterns. Again the 
rule of parsimony was used. The model selected was the — 
SMA (1) mwcdéel with the forn: 


Y = 42.46 + (14+.77€B12) e, (egn 3.9) 


(1.78) (o2Cs) std. error 


3. Earch 138 


Heré, again, each model selected for evaluation 
satisfied the reguirement fcr random residuals. The sSignifi- 
cant difference tetween these two models is in the residual 
sum cf sguared errors, where the MA(2) model is clearly 
supericr to the MA(1) model. Also encouraging is the sicgnif- 
icant T-ratio exhibited by the MA(2) coefficient in the 
second mecdel. The srcdel selected for forecasting [furfoses 


kaSs 
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Y = €Z.$& + (1-.25CB-.745B2)e, (eqn 3. 10) 


(diez 3) (.2C&) (.208) std. error 


Fo FCRECASTING WITH THE SELECTED MODELS 


Tartle II and figtres 3.12 through 3.15 represent the 
results cf the fcrecasts of the selected models. The actual 
values fcr the net nuaker of target cohort contracts signed 
and tke forecasted values are displayed within each tatle 
secticn. The 95 percent limits displayed indicate that cne 
should re certain that the actual value will fall Ltetween 
these twce values 95 fercent of the time. In figures 3.12 
through :.15, the forecast results are displayed graphi- 
cally. The actual net contracts signed are dipicted ry the 
letter "A", the forecasted values by the letter "F" and the 
upper and lcwer 95 percent confidence limits by the letters. 
"U" and "L" respectively. 

Takle JI2Z is divided into four sections. The first 
secticn, ccrrespondics to figure 3.12, is a forecast fcr six 
fericds ahead fer tke period October 1983 to March 1584 
kased on the October 1978 to September 1983 time series 
utilizing the seasonal] MA(1)D model previously selected for 
this time series. With the exception of the March 1684 
forecast, all actual values fall within the 95 fercent 
confidence limits, however, these confidence limits are too 
wide to ke cf much fractical value. As evidenced [Ly figvure 
Sell, the model dces generally capture the pattern of 
enlistments for the fcrecasted period, although the degree 
of the dé€crease for March 1984 is not well represented. the 
variaticn in the forecasting errors, ranging from a lew of 
#5.0 fercert to a higk of +67.4 percent are not particularly 
encouraging. 


3 / 


In the second section, ccrresponding to figure 321379 
forecasts are conducted on a oponth to month basis utilizing 
the seguential* forecasting methcd. For example, the initial 
forecasts fer this data set were conducted utilizing 60 data 
Foints. When the 61 st data foint became availatle, the 
Farameters cf the same model specification were reestizated 
uSing 61 data yoints anda forecast for period 62 was 
conducted. Forecasts were ccnducted in this manrer for the 
Fericd Cctoker 1983 tc March 1984. As evidenced Ey section 
Two of the table, fcrecasting with this method froduces 
supericr results to fcrecasting multiple periods ahead. For 
the ucst fart, forecasted values well represent the actual 
values, although here again the model fails to adeguately 
represent tke sudden cecrease for March 1984. 

In sections three and f£cur of Table II, corresponding to 
figures 3.14 and 3.15 respectfully, the forecasts cf the 
MA(2) ucdéel developed for the March 1982 to SeptemEer 1583 . 
time series are examined. Section three, as in secticn cne- 
cf the takle, is a forecast for six periods ahead. the 
obvicus prcblem of attempting to forecast future fericds 
with a nxren-differenced MA model is the models tendercy to 
forecast the mean cf the tine series after only a _ few 
pericds, precluding any meaningful evaluation of these fcre- 
casts. Ckserving tte first two forecasts for the feriod 
Cctoker 19&€3 to March 1984, the results are not very encour- 
aging. The errcers fcr these two periods are much toc large 
to pe cf any practical application. 

In s€ction four the forecasts were conductead with the 
seguential forecasting method utilizing the March 19€&2 to 


Septemter 1683 time series. With only one forecasted value 


As soon as the value of the next data_ point Lecores 
availakle, all forecasts for future eriods are ufdated 
utilizing reestimated parameter values of the ARMA model, as 
opposed “tc the. adagtive method in which the ARMA model 
parameters remain urchanged. 
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falling within ten percent cf the actual value, and witk the 
forecasts téeing condtcted only cne period ahead, the sodel 
would aprear to have extremely limited usefulness. 

Cf tke two mcdels develcred, the Seasonal MA(1)C nodel 
develcrped fcr the Octcker 1978 to September 1983 time series 
would seem to be the most useful in forecasting future net 
values of high quality recruits. However, as evidenced Ly 
Table II and Figures 3.12 and 3.13, sudden major swings in 
the values cf contracts signed are not adeguately predicted 
Ey this aodel. In Chapter Four a causal model utilizing 
regressicn analysis will be examined for applicability, and 
to determine if sudden shifts in the dependent variakle will 
ke adequately descrited by a causal nodel. 
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MOTEL EVALUATION RESULTS 


Octoker 1878 to September 1983 
MA(1)D MA(2)D 

Residuals: Ran dcn RANDOM 

Means #0. 8&0 +1.03 

Std Dev: 8. &9 ~o6 
Significarce Level 

for Mean: Zemsiz 2-26 

Resicual 

Sum of Sguares; 4615.61 4416.89 

T-Raticos: 

MA (1 +6. 39 3.46 

ua tz) NA e472 

SMA (1 -2. 76 - 3.01 
Octoker 1878 to Febrvary 1382 

MA (1) MA (2) SHAG) 

Residuals: Ran dcnm Randon Randcn 
Mears +0. 76 +Q.73 +Q0.75 
Std Devs Ga-27 6.35 6.37 
Significance Level 

for Means 1. 96 1.98 2200 
Resicual 

Sum of Sguares;: 159 4.E&5 1634.77 1646.43 
T-Ratios 

MA ] -1.05 -0.91 NA 
SMA 1 =33°52 -3.73 -3.83 
Constant: 42.37 42.43 Y2.4E 


Actual mean for times series: 42.83 


ma MA (2) 

Residuals: oe Randon 
Means 2 40.42 
Std Lev: 8. 48 7.13 
Significance Level 

for Féan: 3. 89 Sect 
Resicual 

sum of Squares: 1294.39 918.04 
T-Rati1cs: 

a #0. 54 +1.20 
MA (z NA +3.59 
Constant: 62. 47 62.95 


Actual néean for the time series: 62.53 
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TABLE II 
FORECAST RESULTS 
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IV. THE REGRESSION MODEL 


A. EASIC ASSUMPTIONS 


It was assumed tkrat the forecasts generated with the 
ARIMA model presented in Chapter 3 could be improved ufen by 
develcping a model tshich tock into consideration causal 
relaticnshirs. The factors assumed to most actively irflu- 
ence tke net number cf high quality nonprior service males 
cpting te jcin the Navy was ccenjectured to ke: numker of 
Froducticn recruiters, scme measure of unemployment, size of 
the werk force, military/civilian pay ratio and recruiting 
goals. 

The utilization cf the regression model reguires tkat 
four tasic assumfgticrs be made. First, that the dependent 
variakle, net contracts signed, is linearly related tc each 
of the inderendent variables. If the relationships are not 
linear, then regression analysis cannot accurately be 
applied to the froblen. Second, the variance of the resi- 
duals (errors) cf tke regression eguation remains ccrstant 
cver tke tite series. Third, that the residuals are indefpen- 
dent cf cné another, that is any given residual shculd not 
ke a functicn of the error terms preceeding or follcwinrg it 
in the tite series. The existence of such a relationshif, 
called serial (auto) correlation, implies that either an 
lmportant independent variatle has been omitted or the wreng 
functicnal form has teen used in the regression eguaticn. 
Generally fcr a data set of the size used in this study, a 
Turbir-Watson statistic value retween 1.5 and 2.5 implies a 
lack cf serial correlation. The final ftasic assumpticn is 
that tke residuals are normally distributed. 


WE 


E. VABRIAELES CONSIDEBED FOR ISCLUSION 


AS Freviously defined, the dependent variarle is the 
total mnumker of cortracts signed per month by onenfrior 
service tEigh school graduate males in mental grceurs [I 
through III uppfer within the Albuquerque, New Mexico 
Recruiting CListrict less the numker of Céelayed Entry Prcgran 
(DEP) attrites of the same cchort during the same aucnth. 
This cata was obtained from Navy Recruiting Command for the 
feriod Octoker 1578 tc Septemkrer 1983. 


2. Ibe Number of Producticn Recruiters: RCTRS 


The values fcr this variable are the number of Navy 
Recruiters who were actually on duty in Navy Recruiting 
Cistrict Alkuguergue during the feriod of the time seriés. 
They Lépresent actual Canvassers and exclude staff . 
perscnnel. This data was crtained from Navy Recruiting 
Command. It would seem to be logical to expect an increase 
in the nunter of froduction recruiters to result in an 
increase inthe nugzter of contracts signed, subject to 
effects cf targinal jsreductivity. 


3. Numkers c£ Uremployed: UNEMPLD 


a == = oo ase ae EE 


The values fcr this variable represent the numbers 
cf people determined to be unemployed by the JU. Sic 
Tepartment of Labor. The values of this variatle were 
divided -y 10,000 so that the data could be manipulated mcre 
€asily. These values were divided by 10,000 so that the 
data could fe manipulated more easily. These values were 
ruilt up from a county level and represent the seasonally 
adjusted number of uremployed within the target recruiting 
district as a whole. No breakdown by sex or age grourf was 


availatle. This data was oktained from Navy Recruiting 
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Commard. It was assumed that an increase or decrease in the 
numbers of uneagplcyed would result in a corresfpending 


increase or decrease in the number of contracts signed. 


The values fcr this variable represent the nuuakers 
in the werk force, eugloyed and unemployed, as determined by 
the U.S. LCepartment cf Labor. The values of this variatle 
were Eruiljt up frem a county level and represent the seascn- 
ally adjctsted work fcrce in the target recruiting district 
in tctal. This data was oktained from Navy Recruiting 
Connanrd. As the values for this variable increased or 
cecreased, it was expected that the numbers of ccntracts 
Signed wculd increase or decrease correspondingly. 


5. Military/yCivilian Pay Batios PAYRATIO 





Ike merits cf any ccnstructed military / civilian | 
Fay ratio could be argued indefinitely with no concrete 
results as to a most meaningful measure. For the furfoses 
of this study, the military component of the ratio was 
calculated as monthly tase pay for an E-1 with omecre tkan 
three mcnths service. The civilian component utilized was 
average weekly earnings of froduction workers in the state 
cf New Mexico multiplied by four. The average weekly cearn- 
ings figure was obtained frcem "Employment and Earnings" 
published Ly the U.S. Department of Labor; Bureau cf lator 
Statistics. Intuitively, an increase in the gfilitary/ 
Civilian pay ratio wculd result in an increase in ccntracts 
Signed. 


6. Eercent Unemployed: FPEFCUN 


The values of this variaktle are UNEMPLD/WKFRCE. This 
variakle represents the percentage unemployed in the target 


recruiting district. This variable was included in the study 
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to determine if the percentage of unemployed within the werk 
force ucre accurately predicted the number of contracts 
signed tkan the crdiral value cf unemployed. 


7. Change in Un¢easgloyment: DELTAUN 


The Values for this variable represent the fBonth to gonth 
changes (+t /-) of the variakle PERCUNE for the target 
recruiting district. This variable was included in the 
study to determine if changes in the unemployment rate could 
signal ccrresponding changes in the number of contracts 


signed. 


€. Unensployment katio: UNEMRAT 





The values of this variable refresent a ratio of the change 
in uréemgleyment fren month to month within the target 
recruiting district as expressed by the following fornula: 


UNEKELD in month T ~- UNEMPLD in month I-1 


UNEMRAT = a rn ee eee 


UNEMELD in ponth T-1 


It was hcped that this variable could capture the effects of 
roth tke percentage unemployed and the change in unenflecy- 
tent cn a mcnthly basis. 


$- Coals = Quotas 


Unfortunately, there was insufficient data cn goals 
for the tarcet cohort to examine goals for inclusion in the 
regreéssicn equation. Intuitively, goaling would seem to play 
a key rcle in determining the number of contracts signed. A 
recruiting district which had achieved target guotas for the 
month wotld be tempted to delay additional accessiors until 
the fcllcwing month to helf insure the attainment cf goals 
cn a Ccntinuvuing basis. Additionally, goals would seem to 
mask the supply function of recruiting making the study of 
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the supfly protklen much mcre-'dirficultwc sre 1s the 
hypotktesis of this author that a detailed study cf the 
effects cf goals on enlistment contracts signed would indi- 
cate a very high pcsitive relationship bewteen these two 
variatles. This 1s ar area which demands increased attention 
and study. 


Cw. TEE KEGHRESSICN ACCEL DEVELCPED 


In determining the final choice for the explanatcry 
variakles, it was recessary to determine the lag or lead 
effect, if any, that was exerted on the dependent variatle 
ky each independent variable. For example, an increase in 
the numker of production recruiters may not affect the 
number cf ccatracts signed for several months, assuming that 
there is a learning curve associated with the pcsiticn. 
Similarly, an increase in the unemployment rate may not 
affect erlistments fcr several nonths. All of the indepen- ~ 
dent variatles frevicusly described were given a lag and 
lead cf frcem one to ix months and simple regressicnse were 
conducted tc determine which increment of lag or lead best 
explained the variaticn in the number of contracts signed. A 
variarkle was allowed to enter the equation only once. That 
is if, fer example, PERCUNE evidenced a significant influ- 
ence cn CNIFCTS laggec¢ one month and lagged two months, orly 
the lag with the most significant coefficient of detergina- 


tion, K-sguared, was included in the regression equation. & 


_ Siegel and Forack [Ref. 2 provide an excellent dicus- 
ee os the interaction betwéen goals and the recruiting 
effort. 


6Tbe decision to utilize a Saas ad or. lead for each 
variakle was made pricr to ccnducting the initial regres- 
Sion. dHcwever during the the analysis, all variakle lags 
and leads with a T-ratio greater than 2.00 for a simfgle 
regressicn were fitted into a multipfle regression. The coef- 
ficient cf determination of this multiple regression was not 
Significantly different frem the coefficient of deterfina- 
tion for the multifle regression utilizing a single rest fit 
Jag cr lead for each variable. 
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A multiple regressicn mcdel utilizing the lag cr lead of 
each variatle with tte highest coefficient of determination 
was ccnstructed, and a regression conducted on the dependent 


variaktle net contracts signed with results as follows: 


= -42.6 + .405 RCIFS1 + €.0 UNEMPLD1 —- 1.0 WKFRCE1 
+ 256.C PAYRATO4Y —- 241.0 FERCUNE2 + 28.2 DELTAUN 


+ 1€.8 UNEMRAT2 + a (egn 4. 1) 
where: 
RCTES1 = numter cf producticn recruiters lagged 1 mcnth 


UNEFFLCT1 = unemploygrent lagged 1 month 

WKFECE1 = numters in the werk force lagged 1 nonth 
PAYRAIC4Y = military / civilian payratio lagged 4 menths 
PERCUNE2 = percent unemployed lagged 2 months 

DELTAUN = change in unemployment with no lag 

UNEMBAI2 = unemployment ratio lagged 2 months 


a = residual 

ST. DEV. I-RATIO= 
COLUMN COPE LCIENT OF COEF. CORE Sa be 
COKSTANT -~42.6 158.9 = Oneee 
RCTES1 0.4051 0.2668 ie 
UNEMPLI1 5S] 17.81 0.33 
WKFRCE1 OS 1.187 =O. 43 
PAYRATICY Zoe? 96.18 2. €6 
PERCUNE2 ~2z41 2498 =e 10 
DELTAUN 28423 O26 iia 1D 
UNEMRAT2 16 .&1 ie o2 1. 46 


THE Si. LEV. OF Y AECUT THE REGRESSION LINE IS 
s= 8.030 


B-SCUAREL 62.2 PERCENT 
ER-SCOAKREL = 56.7 PERCENT, ALJUSTED FOR D.F. 
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ANALYSIS CF VAnava ee 


DUE 10 DF Ss MS=SS/DF 
REGRESSICN 7 £097.30 728.19 
RESILUAL 48 2094.84 64.48 
ICTAL 55 €192.14 


Those variatles sith negative T-ratios were €xcluded 
from consideration in the final regression model as having 
counter intuitive results. The regression was ccnducted a 
second time utilizing the variables: RCTRS1, UNEMELI1, 
FAYRATIO4U, LELTAUN, arc UNEMRAT2. The results of this seccnd 
regressicn were examined and those variables having a 
I-ratio’” less than 2.00 were €liminated from consideration 
in the firal model. A third multiple regression was tken 
conducted with the fcllowing results: 


=-61.E+4. UNEMPLD1+192.PAYRATC4U#+19.8DELTAUNta (egn 4.2) 


bhere ; 
OUNEFEILIIC1 = unegrplcysaent lagged 1 ponth 
PAYRAICY = militaryycivilian payratio lagged 4 months 
DELTIAUK = Change in unemployment with no lag 


a = residual 
STs) DEV. I-RATIIC = 
CCLUMN COEFFICIENT OF COEF. COEF/S.ue 
CCNSIANT -61.84 354.18 = 16 
UNEMELD1 3.670 0.545 6.€1 
PFAYRATO4S 193.2€ 20a) ze 14 
CELTAUN 19.755 9.410 Ze tt 


THE ST. LEV. OF Y ABCUT REGRESSION LINE [5 
Sa eat Vz 


7A high T-ratio indicates that the independent variatle 
is imgpertant in explaining the value of Y. The higher the 
I-cratic, the more unlikely that the b coefficient 1s a 
randcr variation frog zero. 


aie 


58.2 PERCENT 
Pomc eERGENIT, ALJUSIED FOR D.F. 


Peo GUAR EL 
E-SCUABREL 
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ANALYSIS CF VARIANCE 


DUE TC DF SS MS=SS/DF 
REGRESSICN 3 4770.27 1590.¢9 
RESICGAL 52 - 2421.88 65.81 
TOTAL 55 (8192.15 


The signs of the coefficients of the explanatory vari- 
ables are what one weculd exfect intuitively. As unemplcyment 
increases, so does the numtéer of contracts signed. An 
increase in the military/fcivilian pay ratio will cause an 
increase inthe nuater of contracts signed, and as the 
percentage unemployment changes, the numbers of contracts 
signed change accordingly. 

Cne cf the mere surprising results of the regression was 
the exclusicn of the variable ECIRS, representing the numter 
cf prceducticn recruiters. When forced into the regression 
egquaticn, the coefficient for this variarle was never 
Significantly differed from zero. Experience and intuition 
would indicate that recruiters play an important role in the 
numbers cf high guality nonprior service males enlisting in 
the Navy. 

There may be several explanations for the results cf the 
regressicn in regards to recruiters. In €Xamining the tine 
series slots for CNTECTS and RCTRS contained in Aprendix B, 
it is evident that cver the approximate last half cf the 
Feriod tre numbers cf contracts signed is increasing as the 
rumber c£ producticn recruiters is decreasing. It is 
Fossiftle that as tke Navy gains more experience in the 
selecticn cf recruiters, better recruiters are placed with 
the recrtiting districts, resulting in higher individual 
recruiter froduction. Also, it is possible that the various 
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services compete with each other for the target cchcrt with 
approxzingately egual determination thus canceling the 
apparent overall influence cf the recruiters. It is also 
fossikle that the tarcet cohort is not as easily influenced 
ky recruiters as "lcwer quality" recruits, as the tarcet 
cohort sculd seem tc have a greater selecticn of civilian 
cccurpaticnos. The individuals in the target cohort may have 
alreacy made the decision to enlist based on €ccremic 
factcrs fFricr to contact with a production recruiter. And, 
it is fessitle that recruiters really don't have a signifi- 
cant igfact when comysared to the impact of unemployzert and 
compensaticn. 

The studies discussed in Chapter Two found little 
Significance between the nusbers of production recruiters 
and the nugber of erlistments. Perhaps the services in 
general and the Navy in farticular have passed the rfoint of 
decreasirfg returns as far as recruiters are concerned. There 
way ke scme optimal level for the number of freduction . 
recruiters such that additional production recruiters cver 
this level are not beneficial in enlisting additional 
members cf the target cohort. It may be more cost effective 
to invest in advertising than in additional recruiters. The 
area cf costs and benefits of recruiters versus advertising 
1s an area which deserves additicnal study. 

The exclusicn of the variables WRKFRCE, PERCUNE and 
UNEMRAT is not as discouraging. As evidenced in the regres- 
Sion macdéls presented, when these variables where included, 
there was an adverse impact on the T-ratio for the variarle 
UNEMFID1. Since the values of the variable UNEMFLD are 
included in the variable WRKFRCE, and the values cf the 
variakles FERCUNE and UNEMRAT are constructed from the vari- 
able UNEMPLD, multicclinearity among these variables may be 
a factor, and is sucgested Ly the regression results. the: 
effects of uneuployzent andthe change in unempglcyment 


appear well represerted in the final regression equation 
nithcut including these three additional variables. 


C. KESILUALS EXAMINEL. 


Tke Curkin-Watson statistic for the regression was 1.52. 
As previcusly stated, a Durbin-Watson statistic between 1.5 
and 2.5 normally indicates a lack of serial (auto) correla- 
tion fcr a cata set cf the size used in this study. With 
three independent variables anda sample size of 60, a 
Lurbir-Watecn statistic between 1.69 and 2.31 would assure 
no s€rial correlaticn in the residuals. The cbtained 
statistic cf 1.52 falls in the inconclusive Lange. Since 
this test is inccnclvsive, additional study is warrented. 

The stardardized residuals and the raw residuals were 
Flotted in figures 4.1 and 4.z The value of a standardized 
residual was determined by dividing the residual ry its 
estimated standard deviation. Of particular interest here - 
is ary value outside ty- 2.00. In Figure 4.1 there are only 
two values that lie cutside the specified limits. Since tke 
cverwkelzing number of residuals lie well withizr these 
dimits, ne significart bias is indicated. In Figure 4.2, the 
residuals are plotted. The residual mean is -0.000£8120 
with a stancard deviation of 7.89. Examing Figure 4.z, there 
is ne ckrviots suggestion of a fattern in the data. However, 
the Durkin- Watson statistic of the firal multiple recres- 
Sion mucdel reguires that further analysis be conducted to 
detergine if serial correlaticn exists. In the fcllcwing 
chapter, tke residuals of the regression will be examined 
further. Ccmbining the regression model developed in this 
chapter with an ARIMA process as described in chapter 3 
would previde an imprcecved model, assuming serial correlation 
in the cesiduals. If no serial correlation exists, taen the 
regressicn gsodel alore would prove to be the superior scdel. 
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A. CCHEIBEIT MODEL DEVELOPED 


The auitiple recression sodel developed in Chapter 4 
satisfied all requirements except for the possibility of 
serial ccrrelaticn ir the residuals. In order to deternine 
if in fact a serial correlation problem existed and, if so, 
to attemrt to correct this frceklem, the residuals were exan- 
ined further. The hypothesis was that 1f£f an ARIMA rodel 
could ke developed fcr the residuals, then a serial correla- 
tion sroklem existed. However, this same model could in turn 
ke applied as a correction tc the regressicn model for more 


accurate forecasts. 
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Figure 5.1 RESIDUAL AUTOCORRELATIONS. 


oe 


The multiple regression model residuals are displayed in 
Figure 4.2 The autccciIrelations are plotted in Figure 5.1. 
In this figure only the autccorrelation at lag 12 falls 
cutside t/- 2 standard errors. However, no physical inter- 
pretaticr, such as seasonality, can be attributed to this 
spike, and it can be interperted as not being significant. 
Since tkere is no clear indication of a nonstaticnary 
pattern in the data, a white noise model was indicated as 


most afgfrorriate. This model has the forn: 


fe = a, (egn 5.1) 
wheres 
a, = tEe residual value at time t 


a random shcck at tise t 


as 


This indicated tkat a godel could not be develcrped for 
the residuals and, uken the attempt was nade to fit varicus 
ARMA todels to the data, no AKIMA model could be found that 
was afpprepriate for the nondifferenced residual time series. 
This strengly suggests that nce serial correlation existed 
for the residuals cf the regression model develored in the 
Frevicus chapter. 

Reexagining Figure 5.1, there is a slight indicaticn of 
a trerd in the data if the general shape of the plct between 
lags 1and 13 are cbserved. In order to explore this 
further, the residual time series was differenced once, with 
the resulting autoccrrelations plotted in Figure 5.2 . 
Examining this figure, the ixzmediate impression is that the 
data has teen over differenced. This 1s suggested [cy the 
large néecative spike at lag 1. Differencing a staticnary 
series frecuces another staticnary series. However, the 


a 


model gererated ry tte over differenced series will be ncre 
complicated than a acdel generated by a stationary series 
cbhtained with the mirigum amount of differencing [Ref. 10]. 
At this fcoint it was decided tc continue with the attempt to 
construct a combined model, even though it was realized ttkat 
the pessibility was small cf constructing a combined nodel 
which would prove superior to the regression nodel. 
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Figure 5.2 DIFEFEFREHCED RESIDUAL AUTOCORRELATIONS. 


Ike partial autoccrrelaticns of the differenced residual 
time sé€ries are displayed in Figure 5.3 . Based on a visual 
inspecticn cf the autccorrelaticns and partial autocorrela- 
tions a second order autoregressive (AR2) model was 
initially selected fcr testing. However, the ARZ model 
Froved nct to be apprcfriate. Since any moving average (MAq) 
Frocess can be cewritten tec form an AR process, the next 
step was tc test a MAz model on the differenced time series. 
The tested model had the foram: 
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pee = /\2, = (1- © B- OF?) €, (egn 3.2) 


Rhere Zy is the residual value from the multifle regression 
model at tine T and €, 1S the error term for the MAz acdel. 
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Figure 5.3 PARTIAL AUTOCORRELATICNS. 


TAELE Iil 
RESILUAL MODEL EVALUATION 


Residuals of models: Random 

Mean: 0.34887 

std Dev: reise 
Significance level] 

fcr means: 2-00 

TYPE ESTIMATE SI. DEV. MTI-RATIC 
MA 1 ~E472 0.1324 4.89 

MA 2 0.=C60 0.1371 Ze2o 


Tarkle III suggests that the model is appropriate for the 
residual tine Series. The ARMA model for estimating futtre 
errors cf the regressicn modél then beccmes: 
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Aze = (1 -.€472 8 [acces sae (egn §.3) 


hhere /\ies is the estimated error of the multiple regres- 
sion ucdel at tise t and Zy AVA + Zee 


where; 
Zr’ = predicted value of the residual at time t 
[xi = change in tte residual] value from tine t-1 
Zry = value of the residual at time t-1 


letting YY, equal the predicted value of the tsultifle 
Cegressicn nwodel alore, the predicted value of the contined 


model, Wy, , then beccmzes: 


Rie Yio a (eqn 5.4) 
HcweveLr, 1p examining equation 5.3 and taking into 
consideraticn the standard deviation for the values cf Q, 


and 6, , it is ccrceivable that equation 5.3 cculd be 


factcred as: 


{\2.= (1-8) (140.353) €, (egn 5.5) 


Cancelling cut the differenced values, the eyuation LFeccmes: 


Z- = (1+0.35B) €, (eqn 5.6) 


Eguaticn 5.6 indicates that the time series was over differ- 
enced and suggests t&Sing a MA1 model on the nondifferenced 
residual tine series. However, this had previously Leen 
attempted was found rect to Fe affpropriate. To this fcirt in 
the analysis of the residuals of the regression medel, all 
tests ccrducted continue to indicate that the residuals are 
randogr and that no serial correlation exists. The only test 
remaining was tc ccmbine the regression model frevicusly 
constructed with the MA2 model constructed above and to 
compare the results cf each models forecasts. 


Ew TEE FORECAST COMEARISONS 


Ibree acdels have been developed in order to forecast 
net néw contracts sicned. The Eest test of each model weuld 
have teen tc forecast some mucnth or series of months subse- 
guent tc Sertember 1683, the last month of the time series, 
and tc ccugare the results. Due to the nonavailability of 
appropriate data this was not possible. As an alternative, 
the mcdels were used to forecast the last five months of the 
time series, May 198: to September 1983, since the results 
were knocwn and the data reguired for forecasting with bcth 
the pultiple regression and the combired model was avail-~ 
able. The sequential forecasting method was utilized for 
all tkree models with results as contained in Table IV 

As méasured Ly the root mean sguared error, the regres- 
Sion model is superior to the ARIMA model and slightly 
Supericr tc the comkined acdel. If the residuals of the 
multiple regression mcdel were serially correlated, then the 
combined model should have ccrrected for this proklem and 
Froduced fcrecastS superior to the multiple regression 
zodel. The forecast results serve to confirm that the 
Bultisle regression residuals were not serially correlated. 
The superiority cf tke regressicn model comes primarily from 
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TABLE IV 
FCRECAST CCMPARISONS 


MONTH ACTUAL ABIMA REGRESS CCMBINED 
MAY &3 59 60-1 (401.8% 63.3 (+07.3% 68.0 (+15 
JUN & €0 62.5 (#04. 8% 63.4 (+05.7% 55.8 (-00 
JUL &3 £6 78.6 (+39 .3% 67.0 (419.6% 68.2 (+22 
AUG &3 49 61.8 (+26. 1% 64.6 (+31.8% 57.7 (+17 
SEP &2 15 53.C (-29.3% 74.9 (-00.1% 64.8 (-13 
SUM CE SQU2RED ERROFS: 

ARIFA MODEL : 1141.46 

REGRESSICN MODEL: 294.42 

CCMEINEL MODEL : 412.06 
ROCT FEAN SQUARED EFFCES. { RMSE) : 

ARIMA MOLEL 2 16.9 

REGEESSICN MODEL: 9.93 

CCMEINEL MODEL : £10.15 





WHEFE BMSE = [yy ¥)2 
(rc - 1) 


Y, = actual value 
Y, = frecicted value 
n = 5 (number of crservaticns) 


the akility to relate sudden shifts in the dependent vari- 
able tc measures cf unemployment as exhibited [ry the 
predicted values for September 1983. The ARIMA process of 
the ccmktined model has a dampening tendency on the 
Fredicticns when there are sudden shifts in the variatrles. 
The regression process alone would then become the preferred 
todel during a period of econcaic fluctuation because of the 
exhiltited superior akility to capture these sudden shifts in 
accessions. 

Ite difficulty in forecasting with the multirle recres- 
Sion acdel and the combined snodel is that one must also 
forecast future values of the change in percent of uneaplcy- 
ment, the variable DEITAUN. Even with forecasted values of 
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CELIACN, the forecast horizon is limited to one month due to 
the cne macrth lag fcr unenployment,the variable UNEFPID, 
unless future values for that variable are also fcrecasted. 
For any period beycrd one pnonth, the forecast fcr ret 
contracts signed ther deperds on the accuracy of these two 
additicnal forecasts, decreasing the face validity cf the 
predicticn due to comgounding negative effects. The attrac- 
tiveness of the Box-Jenkins arpfrroach is that only the values 
cf the time series are taken into consideration in the fore- 
casting process. However, the basic assumption of continuity 
in the pattern of tke data must hold generally true if the 
results are to be of value. 


Aw. SCUMBAEY 


Tkree mcdels utilizing two distinct analytical mneéthcdcl- 
cgies have Teen freserted. The Box-Jenkins methodology was 
applied to a monthly time series of net enlistments in the 
Albuguergue, New Mexico recruiting district and a rodel of 
the underlying data yatterns developed. A multiple recres- 
Sion acdel attemaptirg to e€xamgine causal relationshifs sas 
derived tased on intuition and the statistical significance 
cf the inderendant variables. A combined time seriesycausal 
model was déevelored treating the residuals from the recres- 
sion sodel aS an ocriginal time series and applying the 
Eox-Jéenkins technigue to them. Forecasts were conducted with 
the tkree mcdels and the results examined. 

A ccuparison of the root mean sguared error of the resi- 
duals cf the three models indicated that the tgultifle 
regressicn rodel was superior tc both the Box-Jenkins sodel 
and the ccmbined sacdel. Accuracy favored the amultifle 
regreéssicn approach, however the need to develop fcrecasts 
for twc principle irdependent variables limit its" useful- 
ness in fcrecasting. A need to forecast extended fericds 
would favor the Box-Jenkins approach since there does not 
exist a reguirement tc include the independent variarles of 
the gwultiple regressicn model. Both the multiple regression 
model and the Box-Jenkins methodology can serve useful 
Furposes when the irherent limitations of each are taxen 


into ccnsideraticn. 
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E. IBFELICATIONS OF IEE RESOLTS 


The gaultiple recgression results suggested three vari- 
ables were primarily responsiftle for the variaticr in the 
net nugker cf contracts signed. These were the ordinal value 
of unemplcyrent, the change in percent unemployed anc the 
tilitary / civilian fay ratio. Of these three variatlles, 
the Lepartment of Defense (DCD) exerts only a partial influ- 
ence cver the military/civilian pay ratio. As _ aliuded to 
previcusly, no single measure of the military/civiliar fay 
ratic car be justified as a "best" measure of this relaticn- 
ship. Tre one freserted ia this study was felt tc te an 
adeguate estimation cf how young men making an occuyational 
choice view the relationshir retween military and civilian 
compensation. This ratio did not take into account = such 
forms cf compensaticn as medical benefits, enlistment 
konusé€s, ccmmisary and exchange privileges, special fays, 
and cther ftenefits. Fach individual places different values 
on tkese additional] forms of compensation depending on 
his/Eer cwn needs and desires. It may be that these cther 
forms cf ccmpensatior frovide the marginal benefit tkat sway 
a majcrity cf enlistees towards an occupation in the grili- 
tary. The effects of these additional forms cf compensation 
have Freen the focal fcint fcr many studies and their effects 
deserve ccntinued research. 

As fer unemploygent and the change in percent unea- 
Floyed, jlattle can re done by DOD to influence these vari- 
ables. Indeed, it ig unreascnable to assume that anycne 
would desire to place the economy of the nation in feril 
Simply te ease recruiting short falls. However, it is clear 
that cgené€ral economic factors play a pivotal role in the 
cccupaticnal] choice frecess and that the effects of changes 
in the €concmic indicators can have an immediate impact. A 
sudden increase in civilian jok market opportunities for the 
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target cchcort would bave an imagediate and adverse igpact cn 
Navy recruiting. During periods of general economic inpreve- 
ment, mcre resources will Le required to obtain sizilar 
rumbers cf high quality enlistees than duriny a_ fericd of 
ceneral €ccnomic decline. 

Remainirg unexplained in the analysis is the shift in 
the tize series from apparent seasonality over the first two 
thirds of the series to aprarent naonseasonality cver the 
last third of the séries. Seasonality 1n recruiting with 
Feak months in August and January can most easily be 
explained Ey the schccl year. Intuitively, a young man would 
not desire to graduate from high school in June and directly 
enter gilitary service. Or the average, one would expect 
him tc desire to regain among his friends for the Sunzer 
ponths, ertering tke service at the end of Summer when 
friends either return to schocl or enter the job market. The 
feak gucnth of January can Le intuitively explained Ey the 
Fotential recruits desire to remain among family and friends 
during tke holiday season pricr to entering the service. 
What is not readily explained is the virtual absence of this 
seascrality over tte last third of the time sé€riés. 
Rememnkering that the dependent variable 1s contracts signed 
and net accessions, what may explain the change in the data 
fattern is a change in the gucta system. Utilizing the CEP 
Fool, the potential recruit 1s currently able to signa 
contract with the Navy and delay entry into the service for 
a period cf up to cne year. This may have the effect of 
dampening the seasonal pattern exhibited by the first two 
thirds of the time series. However, this is fure conjecture 
as there was insufficient data available to conduct further 
analysis or draw any definite conclusions. 
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C. RECCBSENDATICNS FCR FUTORE STUDY 


Ite uwcdéels presented in this study should be refined and 
updated. The 60 data foint time series utilized is tco [rief 
to accurately depict cyclic flucuations in the data set. Ihe 
time series should re expanded as additional data fecints 
kecome available. Estimates fren nodels developed from time 
series cf adequate length, in excess of 100 data feints, 
could prceve beneficial in allecating quotas and recruiting 
resources. Regressicn analysis, examining causal relaticn- 
ships cn a recruiting district level, should ccntiaue ia 
crder that a better understanding of the effects that local 
econcmicse have on tke supply cf potential enlistees can be 
cbtained. Cne area cf potential intrest may te the apflica- 
tion cf leading indicator acdels, an extension of the 
Fox-~Jenkins methodolcgy, to the problem of predictirg the 
supply of recruits. 

Lue tc lack cf sufficient available data, several refin- 
ments in this study which the author feels are important 
could not be made. First, in the area of unemployment, an 
accurate [reakdcwn Ey sex, age, and ethnic group by 
recruiting district wculd have been of significant Lrenefit. 
To make an apprecialtle improvement in the understanding of 
the interaction of ilccal economic conditions on the supply 
cf pctertial enlistees this data will be necessary for 
future studies. Some sort of working agreement with the 
LTepartment cf Labor; Bureau of Labor Statistics would te 
necessary in order te obtain the data in a useful fern. A 
restructing of the recruiting districts would make data 
Gathering €asier. A realignment along state boundries would 
take close coordinaticn with state governments mere readily 
avialarle. There could be a cooperative effort between the 
state uremnployment cffice and the recruiting district. 
Althcugh scme states would be reluctant to act as a 
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recruiting agency for the militartye the central there would 
re to find empicyment for the states young people tc the 
renefit cf the state and the Navy 

AS previously aiscusseds the lack of sufficient data 
concerning guotas proved a drawback. There are three measm 
yres cf recruiting effort cver which the Navy has complete 
contrcd shich would seen to be inseperable in theict effects. 
qhese are the nuaber cf production recruiterse qguotase and 
jocal advertising: 1c study any one of these or to jnclude 
any cne cf these in a study t° the exclusion of the ethers 
would see8 to be of little yalue- pecords of adequate Length 
and detail should be kept iD order to study the jong tere 
effects cf these factcrs and cpecifically how they interact 
with each cther- It may ke peneficial to the recruiting 
effort t¢ gecrea se the number of production recruiters while 
gncreasing coals and joc al advertising: Howevel,s unles® the 
data 1s available to study, or pilot programs are jnitiateds 
there cab ke no clear aemonstration of the anterrelatica™ 
chips cf these yar1icus recruiting ef forts- without etrcnag 
evidence of the . pterrebationsh+ Ps between thesé three 


efforts. nce study of the supEly of first tera enlistees can 


anctreL interesting area which should be examined is 
peLlayed Entery progral (DEP) attrition: Le yould Le joter~ 
esting t¢ determine j£ an juproved job parket corresponded 
to an jocrease in the number of contract signers attenEting 
to nullify their ccntract- In some cases 4 new recruit has 
the pctential Lo remain in the DEP pool for up t9 ene yeat- 
The eccncpic opporturities for this potential recruit could 
vaL Y considerably during this years tine, and during ? 
general inprovement jn the econonye civilian job cpportunl- 
ties cculd pecoe availablé to the cohort member that did 
not exist when he/she bec ame part of the DEP pool- jhe 
erosicn cf the DEP pool should be studied a5 a ectential 


problem areae 
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Cf particular benefit would be a regional study encca- 
passing accessions into all four branches of the silitary 
and tke Coast Guard. A study of this type would permit the 
examiration of the interaction of the different recruiting 
efforts. This wculd assist in determining if an increase in 
the recruiting efforts of any one service served to increase 
the tctal number of accessicns, Or merely drew away peten- 
tial recruits frem one of the other services. Additicrally, 
Ey ccifaring the recicnal economy to total military acces- 
sions for the region, a clearer understanding of the effects 
cf changing econcmic conditicns could be obtained. 
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APEENDIX A 
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PRIMER CN TIME SERIES ANALYSIS 


Aw. TIBE SERIES ANALYSIS 


Excerpt where otherwise noted, this appendix is Eased on 
the werks cf Ss. Makridakis and S.C. Wheelwright. A tine 
series is acollecticn of cbservations made sequentially 
in tige. Examples occur in a variety of fields fror physics 
to physiclcgy and methods of analyzing time series consti- 
tute an impertant area of statistics. Statistical techniques 
for analyzing time series range from the straight forward to 
the very scphisticated. However, traditional means of tine 
series aralysis are frimarily concerned with decomposing a 
series intc the basic comfonents of trend, seascnality, 
cyclic chanses and otter fluctuations. 

The first step in analyzing a time series is tc flot the 
cbservaticns against time. Figure A.1is atime series flot 
of the sales figures for a certain engineering conpfany over 
successive months [Bef. 11}. This plot can be used to 
obtain sinuple descrirtive measures of the pain proferties of 
the sé€ries. For exauple, locking at Figure A.1, it can be 
seen that there 1s a regular seasonal effect with sales 


"hight ir the winter and ‘low' in the summer. The craph 
Suggests an upward trend in annual sales. For scme time 
serles, the variation is dominated by such ‘ckvious' 


features, and a fairly simple model may be _ rferfectly 
adequate tc describe tte variation in the time series. for 
cther time series, more sophisticated techniques will be 
required tc provide adeguate analysis. 

When ckservations are taken on two cr more variatles, it 
tay ke ¢-cessible to use the variation in one time series to 
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Figure A.1 Bcnthly Sales. 


explain the variaticn in another time series. This may lead 
to a deeper understarding of the underlying causal relaticna- 
ships in a process and permit accurate forecasts of future 
values of the variable of interest. This may in turn aid in 
the certrcl of that future value. 

Ike ability of a given technigue to forecast effectively 
in a syecific situaticn depends largely on accurately iden- 
tifying the patterns underlying the data and selecting the 
Froper technigue to handle them. All time series analysis 
technigues assume that the patterns underlying the data are 
constant over twc yeériods; the period over which the data 
was ccllected, and the period of the forecast. However, 
different technigues are available to fit a wide variety of 
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Situaticns. The factcrs that need to be ccnsidered when 
selecting a model are as follows: 

1. Tke tige horizor for decision making 

2. The pattern of tke data 

3. Type cf model dé€sired: tise series or causal 

4. Ike value of the forecast 

5. The accuracy that is reguired 

6. The ccuplexity that can ke tolerated 

7. The availability of historical data [Ref. 8] 


Ee. TEE FOX-JENKINS #ETHODOLOGY 


1. The Advantages of Box-Jenkins 


The Box-Jenkins methcdology is an efficient and 
Fractical procedure fcr handling time series and cther fore- 
casting situations in which a variety of complex fatterns 
exist. This methodolcgy can handle complex patterns of data . 
uSing a relatively well specified set of rules that de net - 
reguire that a mcdel re chosen on the basis of an analyst's 
experience, intuitive ability or theory. 

EFox-Jenkins is the gost general methed of 
apprcaching time-series forecasting. There is ne need to 
assugze iritially some fixed pattern to date. The approach 
kegins Ly assuming a tentative pattern that is fitted tc the 
cata so that the errer will be minimized. It additicnally 
Frovides explicit information to enable the user to judge 
whether the pattero tentatively assumed is correct for the 
Situaticn under study. If so, the forecast can be develosed 
directly and, Lie niGt, the Eox-Jenkins approach frcvides 
further clues for identifying the correct pattern. This 
Frocedere allows the user to arrive at a forecasting nodel 
that achieves optimization in terms of the Lasic pattern and 
Hinirizes tke forecasting error. The user is also sufflied 
with statistical infcrmaticn on the accuracy of the fore- 


casts. 
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Z. The General Afrroach 


In describing their apfroach, Box and Jenkins have 
develcred the schegatic diagram shown in ELGUEC \ he 
[Ref. 12). This apgrroach divides the problem intc three 
stages. In stage one, a specific model is suggested as the 
forecasting method best suited to that Situation. In stage 
two, that suggested acdel is fitted to the historical (tine 
series) cata aad the suggested model is checked to deternine 
its adequacy. If the model is not adeguate, the analyst 
returns to stage one, and an alternative model is prorpesed. 
When an adeguate model is identified, the development cf a 
forecast fcr some future time period is conducted in stage 
three. 


An autocorrelation ccefficient is similar to a 
correlaticn coefficient except that it describes the asscci- 
ation amcng values cf the Ssaze variable but at different 
time periods. The autccorrelation among successive values of 
the data is a key tccl in identifying the mest afprofriate 
model ccrresponding to the Easic pattern of the data. 
Correlaticn implies ncthing akout a change in one variakle 
causing a change in tke other variable. 

Ic further déegonstrate, we can construct an artifi- 
cial variakle from arcther variable by changing the time 
crigin cf the data. For example, dropping the first value 
cf ‘A* and letting tke second value be the starting value of 
"B', See figure A.3 . In this example variables ‘A* and 'B'* 
can Fe treated as tuc separate and distinct variables even 
though they came frcm the same data set. In this example, 
"BY has a time lac of one feriod from ‘At. This same 
apprcach can be expanded to create additional data sets 
utilizing as many time lags is as deemed necesSary for iden- 
tifyirg the appropriate model. 
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Figure A.2 Box-Jenkins Method. 


76 


—_ a> — a 2 ak oe, ee a Ee ae 7 ee Ee EES ee cee 


UN = ln Ola» 
(nd 2Wwinadm 


Figure A.3 Lagged Variables. 


We can ccnsider variakles ‘A* and "B* as two vari- 
ables and calculate tkeir ccrrelation coefficients. A coef- 
ficiert cf .80 between *A* and "B*", for example, would imply 
that successive values of ‘A* with one feriod (lag) -Fetween 
them are positively correlated with each other and terd to 
move in the same direction. Similarly, a coefficient of -.70 
ketween ‘A* and ‘B® wculd indicate that successive values of . 
"A* lagged cnce are regatively correlated and tend to acve 
in oggesite directicts. However, since variable 7 ES 
actually derived frog variable ‘A', Such an associaticn is 
called attc-(self-) ccrrelation. 

Autccorrelaticns provide significant clues atrcut the 

underlying pattern cf a data sét. In a set of ccapletely 
randcm data the autccorrelations among successive values 
will ke clecse to zere, but data values of strong seasonal 
andyor cyclical character will be highly correlated. 
Figure A.4 [Ref. 13] for instance, presents the autccorrela- 
tions cf different time lags of monthly temperature in 
Faris. These autoccrrelations reveal a strong seéascoral 
fattern. 
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Paris. 
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Monthly Teaperature 


Figure A.4 


It is this type of information gained from autcccr- 
relaticns tkat can [te utilized by the Box-Jenkins affEroach 
to identify the optinal forecasting model. Of particular 
jnterest is that the analyst needs to know nothing of the 
data sé€t or its fatterns to oktain its autocorrelaticn coef- 
ficients. These coefficients can be used to describe the 
Gata set and assist in identifying a tentative model to be 
fitted tc tke data. 

hKhat Figure A.4 implies is that temperature of 
months twelve periods apart are highly correlated. Although 
this is ktasically intuitive, if this were not fprevicusly 
known the autoccrrelations would be of Significant -rFenefit 
in suggesting an agfrropriate model to describe the data 
pattern [Ref. 13]. 


YW. Eox-Jenkins Mcdel Types 


= SEP gn ae a SS See Se PE a <= 


Ike Box-Jenkins methodology postulates three general . 
classes cf models that can generally descrite any tyre or 
pattern cf time series data: 

a. Autcresgressive (AR) 


b. Moving Average (MA) 
Cc. Hixed Autoreéegressive-Mcving Average (ARMA) 


An autoregressive mcdel (AR) has the forn: 


te = @OY,., +@DY,. +Di,, te-- +Dy, te, (egn A.1) 


where Yyzis the independent variable, high guality recruits, 
and Yr, olr2 «7.5 geeeetrp are “independent" varialles. 
In this case the irdependent variables are values cf the 
Same variable, but of previous feriods (t-1,t-2,t-3,...t-[). 
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Biden yy is the e€rror or residual term that refreserts 
crandogw events that are not explained by the model. The 
residuals (e€rrors) of the model should be white noise, tkat 
is they shculd have a mean of zero, a constant variance and 
ke unccrrelated cver time. 

The model described is autoregressive because it is 
Similar tc the regression eguation 


Y=aer, 3, +4 X, +b SX, +...+d, X +e, (egn A.2) 


except that: 


Xx, Tiles eo hoe Lae x, “VY yeccee 1,=1,_, 


and tkus the independent variatles are only lagged values of . 
the déeperdart variable with tise lags of 1, 2, 3,+<ay7men 
feriods. The autoregression mnodel is Simgilar in functicn and 
form tc the regressicn equaticn. However it is net afpro- 
Friate tc descrite all fossible underlying data patterns in 
time series analysis. Thus, the Box-Jenkins approach also 
considers tso other classes cf models. 
The Moving Average (MA) model has the forn: 


Y-=€,-0 €y, -O,e ¢2 ~@,¢ c-s Be Res (egr Ac3} 
where ¢€, is the errcr or residual and e,,, Cr. 1 Or-3 
seece€y.5 ale previous values of the error. This isgplaes 


that tke defendant variable Y,; , depends on previous values 
c£ the errcr term (€,, ,@zr. sr acces Seng ) rather than on 
the variarle itself. The future value of high quality 
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enlistments could be predicted by utilizing the errecr of 
each cf several past [redictions. In the same manner tkat 
autocorrelation among successive values of Y can te exan- 
ined, the autocorrelation among the successive values of the 
residuals can also be examined. 

The third Class of model considered ry the 
Eox-Jenkins methcdolcgy is a mixed model. The best descrip- 
tion of the pattern of data may be frovided by a tixed 


Frocess cf AR and MA elements. The general mixed mcdel has 
the fcr: 
Y¥,=OY,., *OY,, t--- tO 37 p te, - OS, -Oe,.5 °° - a rH4 (e€gn A.4) 


It is evident that tlkis model is simply the AR and MA acdels 
combired. It suggests that future values of high guality _ 
recruits depend cn beth past values of high quality enlist- 
tents and tke errors retween forecasts and actual values. 


se Identifying a Tentative Hodel 








It is possible to suggest some specific ARMA (f,q) 
model Ey e€xamining tke autocorrelation coefficients anda 
Similar set of parameters, the partial autocorrelaticn coef- 
ficierts. Partial autccorrelaticn coefficients are analcgcus 
to autcccrrelaticn ccefficients in that they indicate the 
relaticnship of the values ina time series to varicus 
lagged values of the same series. However, they differ frcn 
autoccrrelations in tkat they are computed for each tine lag 
after regzoving the effect of all other time lags in the 
given tise lag and on the original series. In essence, tkey 
show the relationship that exists for varying tize lags 
[Ref. 8]. 
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Ccmkinations cf autccorrelation and partial autccecr- 
relaticn ccefficienats, in the absence of randomness, can 
reveal the e€xact ARMA model. Randomness complicates the task 
kecause it influences the values of the autocorrelation 
coefficients and may cause them to deviate from their "true" 
pattern. This makes acdel identification more difficult, Eut 
generally there is ercugh information available to select an 
appropriate model. Even if there is a mistake at this foint 
in the aralysis, it will be found when a test of the rcdel 
is ccndtcted. Figures A.5 through A.9 demonstrate the 
kehavicr cf the thecretical autocorrelations (ACs) and 
partial autccorrelaticns (PACS) for each model, where "MA" 
denotes a moving average model, "SMA" denotes a seasonal 
moving average mcdel, "AR" denotes an autoregressive model 
and "SAR" denctes a seasonal autoregressive mcdel. 
[Ref. 10] 

Thus in crder to forecast a time series, the auto- . 
correlaticns and partial autocorrelations must be ccaruted . 
first. Scaetimes tke pattern of the computed autocorrela- 
tions and partial autccorrelations can be easily classified 
as cne of the theoretical ones described below. The identi- 
ficaticn cf an apprcsriate model is then relatively easy. 
Cther tines, some free associaticn and intuitive judgement 
1s reguired to be able to infer a pattern from the autcccr- 
relaticns, or mucre tkan one fattern may be suggested. In 
such irstances, a choice can be made based on gcodness of 
fit ard nuaber cf parameters involved. Once a tentative 
model has Leen made, the next step is to estimate the faran- 
eters. 





Assuming that the tentative model is an ARMA (1,1) 


its! gatktematical forgr is 


Yeape Ops Ca 2 <= (eqn A.5) 
in order tc utilize tkis model, the value of D and © must 
ke estimated. The seéeneral afpfrroach is to start with scage 
estimated initia] values for @ and © and then modify them by 
small steps while olserving the sum of the sgquared errors 
(SSE). This allows tke direction of change in and © that 
result in the suwallest SSE to be determined. Eventually, 
the @ and © correspordinog to the minimum SSE are fcund and 
used as the final estimates for the model. The next step is 
to test the adequacy cf the model. 


7. Testing the £cdel's Adequacy 





If the sodel is adequate, the residuals (differ- 
ences) Lretween the time series values and those estizated by 
the xcdéel mest be random (white noise). The autocorrelation 
coefficients for these residuals are examined. If ncene of 
the autccorrelation coefficients of the residuals are 
significantly different from zero (plus or minus twe stan- 
Gard e€rrcers), the errors are assumed to be white ncise and 
the mcdel is adequate. If the autocorrelations are not 
randca, tke analyst must return to step two and select 
another rodel. 
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Figure A.5 AR (1) Model ACS and PACS. 


84 


0 - 


Autocorrelations Partkd Autocorrclitions | 
a (d) ~| Ch) 
AR (2) 
d,= +06 () Q 
lag lag 
6,= +0.2 
a = | 
be (¢) na (1) 
AR (2) 
b, = -—0.6 0 0 
lag lag 
6,= +0.2 
=I | 
+ 
a (f) (m) 
AR (2) 
b, = +0.75 O O 
lag lag 
d,= -0.5 
al = 
> 
+ | (g) l (n) 
AR (2) 
b,= -0.8 Q) () 
lay lag 
Be =06 | 
1 -1 


Figure A.6 AR (2) Bodel ACs and PACs. 


eS 


+ | (b) 
MA (1) 
0 
4,=-08 lag 
—} 
a (c) 
MA(1) 
() 
H = +08 lag 


a en 


+ |} 


(i) 


lag 


=1 


ool 


()) 


a ER a a 


Figure A-7 MA (1) Model ACS and PACs. 
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Figure A.8 
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Figure A.9 Bixed Arma (1,1) Model ACs and PACs. 


8é@ 


Autocorrelauons Span = 12 Partial Autocorrelations 


bd 
+(0.8 (e) 


x 
e 


(a) 
+ 0.4 
: 
12 -4 %© Jag lag 
-0.4 
-0.8 


(b) +0.8 (f) 


+0.4 


lag a 
4 


4 


-0.8 


(c) ao (g) 


+ 0.4 
lag lag 


-0.8 


(d) +08 (h) 


+4 


lag | 
-(04 


+ 


-O8 








Figure A.10 AB(1) 3 SAR(1) Seasonal Model ACS and PACs. 
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Autocorrelations Span = 12 Partial Autocorrelations 
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Figure A.11 AB(2) % SAR(1) Seasonal Model ACs and PACs. 
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Figure A.1z MA(1) X SMA(1) Seasonal Model ACs and FACs. 
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MA(2) X SMA(1) Seasonal Model ACs and FACS. 
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TIME SERIES PLOTS OF THE VARIABLES 
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